
AGGGAGAGTCTGCCCACAAGTTTTTGTATATTTTCTCACTGAGGCATCTATTCA6TTTGGGCAGCAGACA 

CTGAGCAGAACGTAGCACGGCAATGCTTGGTAGCAATGCCTGTCCGGCCAGCACTCAGAAGACGGAGGCA 

GGAGAATCATAGCTTCCAGTCAGCCTCTTCTACAATATAGTCAGTTGGAAGTCAGCCAGCTTAGACAACA 

TGGAGAGCCTGTGCCGAAAGCCACTGGGTAAGCCCGAATCTCAGTAGCAGAGAGCTGCCCAGGGTGCGTA 

CTGC : AAAAAAAAAACCTCAAACAACAGAAGTAGGGAGGTGTAAAATAAAGTGTAGGGGGGTGGAATTTA 

AGCTGATGTGGACTTCCAAATAAAGTTACCTTTTAGATACCTATTTAAATCAATAGCATAGACCTGAAAC 

TGTCTATCAGAAAATGTGTCTATTCTGAGGAAGGAGTGCTAACGAGGTTCTGTGAGGGGGGCCTCTGGCT 

TTGAGAGGGTGTACCATCACATAAGACTCCTAAAAGCACATACTTTTATAAATTCACCATGAGCTTTAAC 

ATCTTCTTTGTCATTTCGCAGACTGAGCCATGGAGTCTTTCGATGCTGACACCAATTCAACTGACCTACA 

CTCACGGCCTCTGTTTCAACCCCAAGACATTGCCTCCATGGTCATTCTTGGTCTCACTTGTCTATTGGGA 

CTGCTAGGCAATGGGCTGGTGCTGTGGGTAGCTGGCGTAAAGATGAAGACGACCGTGAACACAGTCTGGT 

TCCTCCATCTCACCCTGGCCGATTTCCTCTGCTGCCTCTCCTTGCCCTTCTCCTTGGCTCACCTGATTCT 

CCAAGGACACTGGCCCTATGGCTTGTTCCTGTGCAAACTTATCCCATCCATCATTATTCTCAACATGTTT 

GCCAGTGTCTTCCTGCTTACTGCCATTAGCCTGGACCGATGTCTGATAGTACATAAGCCAATCTGGTGCC 

AGAATCATCGAAACGTGAGAACCGCCTTCGCCATCTGTGGATGTGTCTGGGTGGTAGCCTTTGTGATGTG 

TGTGCCCGTATTTGTATACCGTGATCTGTTCATTATGGACAATCGCAGTATATGTAGATATAATTTTGAT 

TCCTCCAGGTCATATGATTATTGGGACTACGTGTACAAACTAAGTCTACCAGAAAGCAATTCTACTGATA 

ACTCCACTGCTCAGCTAACTGGACATATGAATGACAGGTCAGCTCCTTCCTCTGTACAGGCAAGGGATTA 

CTTTTGGACAGTTACCACTGCCCTCCAGTCACAGCCATTCCTAACATCTCCTGAAGACTCATTCTCTCTA 

GATTCAGCAAACCAACAACCCCATTATGGTGGAAAGCCTCCTAATGTCCTCACAGCCGCCGTACCCAGCG 

GGTTTCCTGTTGAAGATCGTAAATCCAATACACTGAACGCTGACGCTTTTCTCTCTGCTCACACAGAACT 

TTTCCCTACTGCTTCTAGTGGTCATTTATACCCCTATGATTTCCAGGGGGATTATGTTGACCAATTCACG 

TATGACAATCATGTGCCGACACCGCTGATGGCAATAACCATCACAAGGCTGGTGGTGGGCTTCCTGGTGC 

CGTTTTTCATCATGGTAATTTGTTACAGCCTCATCGTCTTCAGAATGCGAAAAACCAACTTCACCAAGTC 

TCGGAACAAAACCTTTCGGGTGGCTGTGGCTGTGGTCACTGTCTTTTTTATCTGCTGGACTCCATACCAT 

CTTGTCGGAGTCCTGCTATTGATTACTGATCCAGAAAGTTCCTTGGGGGAAGCTGTGATGTCCTGGGACC 

ACATGTCCATTGCTTTAGCATCTGCCAATAGTTGCTTCAACCCTTTCCTGTATGCCCTCTTGGGGAAAGA 

CTTTAGGAAGAAAGCAAGACAGTCTATAAAGGGCATTCTGGAAGCAGCCTTCAGCGAAGAGCTCACGCAC 

TCTACCAACTGTACCCAAGACAAAGCCTCTTCAAAAAGAAACAATATGAGTACAGATGTGTGAAGATGTG 

GCCCTGGGAACCTAAGCAGAGTTCTCAGGTGAACAGTGATGGATGACATGTGAGCAGGACACTTTAGACA 

ATTTGGCGACTCTCAGAGAAAGGTCTCTTATTGACATCAGCATCATTTGAAAACATTAAAGATGCAAAAT 

TTCAAGCCCCATCCCAGATGTGTTGACTCAGAATCTCTGGCCCATGGGACCAGTGTTTTAACAGGCCTTC 

TTGTTTCCATCAGTGTTAAGTTTTACCTCATTTGGCTTAGTCTATTCCCATCCCTGACTACACCATGTGC 

AATGAATAACTTTTTCATCTGTTTTCAGTATTCTTTTTTTTTCCTTAGCATCATCTAAACTTCTAGTTTG 

CATGGAAGGCTGCTCTTATTGTTCTGAATGGAAGATATTCATTTATTGTACAGTTTTGTGGTGGTGACAA 

GTGATTTTTAAGTGGGGAAAGAGACACAGTAAGAAAAGATCTATGAAAGCAGGGAGTGTTGAGTTAGAGT 

TTGACAGAACACAGTGCCAAATGCCACCCACTAAAAGCAACCTGAGATAATTCCAGTGTTCATGTGAGCA 

AGTGAGCACAGATACACATAAACACTTTCCTACTCCTGGAGTGTTTTAGAAGTTGTAGCTTGGAGCTC 

(SEQ ID N0:1) 

MESFDADTNSTDLHSRPLFQPQDIASMVILGLTCLLGLLGNGLVLWVAGVKMKTTVNTVWFLHLTLADFLCC 
LSLPFSLAHLILQGHWPYGLFLCKLIPSIIILNMFASVFLLTAISLDRCLIVHKPIWCQNHRNVRTAFAICGCVWV 
VAFVMCVPVFVYRDLFIMDNRSICRYNFDSSRSYDYWDYVYKLSLPESNSTDNSTAQLTGHMNDRSAPSSV 
QARDYFWTVTTALQSQPFLTSPEDSFSLDSANQQPHYGGKPPNVLTAAVPSGFPVEDRKSNTLNADAFLSA 
HTELFPTASSGHLYPYDFQGDYVDQFTYDNHVPTPLMAITITRLWGFLVPFFIMVICYSLIVFRMRKTNFTKS 
RNKTFRVAVAVVTVFFICWTPYHLVGVLLLITDPESSLGEAVMSWDHMSIALASANSCFNPFLYALLGKDFRK 

KARQSIKGILEAAFSEELTHSTNCTQDKASSKRNNMSTDV (SEQ ID N0:2) 
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underlined = deleted in the targeting construct (SEQ ID NO:5) 

[ ] = sequence flanking Neo insert in targeting construct (SEQ ID NO:6 and SEQ ID 

NO:7) 

AAGCCCGAATCTCAGTAGCAGAGAGCTGCCCAGGGTGCGTACTGCAAAAAAAAAACCTC 

AAACAACAGAAGTAGGGAGGTGTAAAATAAAGTGTAGGGGGGTGGAATTTAAGCTGATGT 

G^AC^CCAAATAAAGTTACCTTTTAGATACCTATTTAAATCAATAGCATAGACCTGAAA 

ggStc?^ 

aaattcaccatgagctttaacatcttctttgtcatttcgcagactgagccatggagtct^ 
^atgctgacaccaattcaactgacctacactcacggcctctgtttcaaccccaagaca 

^rTHT^GGT AGCTGGCGTAAAGATGAAGACGACCGTG AACACAGTCTGGTTCCTCCAT 
^^Ar'^r'Trta rrnaTTTCCTCTGCTGCCTrTCCTTGCCCTTCTC CTTGGCT CAC CTGAT T 

ctccSggacactggccctatcggcttgttcctgtgc^ 

TCTCAACATGTTTGCCAGTGTC TTCCTGCTTACTGCCATTAGCCTGGACCGATGTCTGAT 

SaS^gccaatctggtgccagaatcatcgaaacgtgagaaccgccttcgc^ 

TGGATGTGTCTGGGTGGTAGCCTTTGTGATGTGTGTGCCCGTATTTGTATACCGTGATCT 
GTTCATTATGGACAATCGCAGTATATGTAGATATAATTTTGATTCCTCCAGGTCATATGA 
TTATTGGGACTACGTGT] ACAAACTAAGTCTACCAGAAAGCAATTCTACTGATAACTCCA 
CTGCTCAGCTAACTGGACATATGAATGACAGGTCAGCTCCTTCCTCTGTACAGGCAAGGG 
ATTACTTTTGGACAGTTACCACTGCCCTCCAGTCACAGCCATTCCTAACATCTCCTGAAG 
ACTGATTCTCTCTAGATTCAGCAAACCAACAACCCCATTATGGTGGAAAGCCTCCTAATG 
TCCTCACAGCCGCCGTACCCAGCGGGTTTCCTGTTGAAGATCGTAAATCCAATACACTGA 

Stacccctatgatttccagggggattatgttgaccaattcacgtatgacaatcatgtgc 
SSaccgctgatggcaataaccatcacaaggctggtggtgggcttcctggtgcc^^^^ 

Stcatggtaatttgttaca^ 

Igtctcggaacaaaacctttcgggtggctgtggctgtggtcactgtctttttta^ 

ggIctcca^ccatcttgtcggagtcctgctattgattactgatccagaaa^^ 

gggaagctgtgatgtcctgggaccacatgtccattgctttagcatctc 

?caa^cSttcctgtatgccctcttggggaaagactttaggaa^ 
taaagggcattctggaagcagccttcagcgaagagctcacgcactctaccaactgtaccc 

aa^caaagcctcttcaaaaagaaacaatatgagtacagatgtgtgaagatgtggccctg 

ggaacctaagcagagttctcaggtgaacagtgatggatgacatgtgagcaggacacttta 

G^CAATTTGGCGACTCTCAGAGAAAGGTCTCTTATTGACATCAGCATCATTTGAAAACAT 

?a^Itgcaaaatttcaagccccatcccagatgtgttgactcagaatctctggcccatg 

GGACCAGTGTTTTAACAGGCCTTCTTGTTTCCATCAGTGTTAAGTTTTACCTCATTTGGC 

??agtctattcccatccctgactacaccatgtgcaatgaataactttttcatctg^^^ 

AGTATTCTTTTTTTTTCCTTAGCATCATCTAAACTTCTAGTTTGCATGGAAGGCTGCTCT 
TATTGTTCTGAATGGAAGATATTCATTTATTGTACAGTTTTGTGGTGGTGACAAGTGATT 
TTTAAGTGGGGAAAGAGACACAGTAAGAAAAGATCTATGAAAGCAGGGAGTGTTGAGTTA 
GAGTTTGACAGAACACAGTGCCAAATGCCACCCACTAAAAGCAACCTGAGATAATTCCAG 
TGTTCATGTGAGCAAGTGAGCACAGATACACATAAACACTTTCCTACTCCTGGAGTGTTT 

TAGAAGTTGTAGCTTGGAGCTC 
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Gene Sequence 
Structure * 

Size of full-length 
cDNA: 2658 bp 



663 bp 



Sequence Deleted 



859 bp 




Targeting Vector* 
(genomic sequence) 

Construct Number: 3036 



Arm Length: 
5': 3.2 kb 
3': 1.8 kb 



Targeting Vector 

- — — — - Endogenous Locus 

* Not drawn to scale 



5 + arm 



LacZ-Neo 
Cassette 



3* arm 



V probe 




5 1 > CGAGGTTCTGTGAGGGGGGCC 

TCTGGCTTTGAGAGGGTGTACCAT 

CACATAAGACTCCTAAAAGCACAT 

ACTTTTATAAATTCACCATGAGCT 

TTAACATCTTCTTTGTCATTTCGC 

AGACTGAGCCATGGAGTCTTTCGA 

TGCTGACACCAATTCAACTGACCT 

ACACTCACGGCCTCTGTTTCAACC 

CCAAGACATTG< 3 ■ 

(SEQ ID NO: 3) 



5 ' >GGCTTGTTCCTGTGCAAACTT 
ATCCCATCCATCATTATTCTCAAC 
ATGTTTGCCAGTGTCTTCCTGCTT 
ACTGCCATTAGCCTGGACCGATGT 
CTGATAGTACATAAGCCAATCTGG 
TGCCAGAATCATCGAAACGTGAGA 
ACCGCCTTCGCCATCTGTGGATGT 
GTCTGGGTGGTAGCCTTTGTGATG 
TGTGTGCCCGT< 3 1 
(SEQ ID NO: 4) 
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